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Long paths and Hamiltonicity in random graphs 


Michael Krivelevich * 


Abstract 

We discuss several classical results about long paths and Hamilton cycles in random graphs 
and present accessible versions of their proofs, relying on the Depth First Search (DFS) algorithm 
and the notion of boosters. 

1 Introduction 

Long paths and Hamiltonicity are certainly between the most central and researched topics of modern 
graph theory. It is thus only natural to expect that they will take a place of honor in the theory 
of random graphs. And indeed, the typical appearance of long paths and of Hamilton cycles is one 
of the most thoroughly studied directions in random graphs, with great many diverse and beautiful 
results obtained over the last fifty or so years. 

In this survey we aim to cover some of the most basic theorems about long paths and Hamilton 
cycles in the classical models of random graphs, such as the binomial random graph or the random 
graph process. At no means this text should be viewed as a comprehensive coverage of results of 
this type in various models of random graphs; the reader looking for breadth should rather consult 
research papers, or a forthcoming monograph on random graphs due to Frieze and Karohski [8]. 
Instead, we focus on simplicity, aiming to provide accessible proofs of several classical results on the 
subject, and showcasing the tools successfully applied recently to derive new and fairly simple proofs, 
such as applications of the Depth First Search (DFS) Algorithm for finding long paths in random 
graphs, and the notion of boosters. 

Although this text should be fairly self-contained mathematically, basic familiarity and hands-on 
experience with random graphs would certainly be of help for the prospective reader. The standard 
random graph theory monographs of Bollobas [5] and of Janson, Luczak and Rucihski [9] certainly 
provide (much more than) the desired background. 

The text is based on a mini-course with the same name, delivered by the author at the LMS- 
EPSRC Summer School on Random Graphs, Geometry and Asymptotic Structure, organized by 
Dan Hefetz and Nikolaos Fountoulakis at the University of Birmingham in the summer of 2013. The 
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author would like to thank the course organizers for inviting him to deliver the mini-course, and 
for encouraging him to create lecture notes for the course, that eventually served as a basis for the 
present text. 


2 Tools 

In this section we gather notions and tools to be applied later in the proofs. They include standard 
graph theoretic notation, asymptotic estimates of binomial coefficients, Chebyshev’s and Chernoff’s 
inequalities, basic notation from random graphs (Section 12.11) . as well as algorithmic and combina¬ 
torial tools — the Depth First Search (DFS) algorithm for graph exploration (Section 12.2|) . the so 
called rotation-extension technique of Posa, and the notion of boosters (Section 12.31) . 


2.1 Preliminaries 

2.1.1 Notation and terminology 

Our graph theoretic notation and terminology are fairly standard. In particular, for a graph G = 
{V,E) and disjoint vertex subsets U,W C P, we denote by Ng{U) the external neighborhood of U 
in G: Ng{U) = {v & V — U : v has a neighbor in U}. The number of eddes of G spanned by U is 
denoted by edU), the number of edges of G between U and W is edU, W). When the graph G is 
clear from the context, we may omit G in the subscript in the above notation. 

Path and cycle lengths are measured in edges. 

When dealing with graphs on n vertices, we will customarily use N to denote the number of pairs 
of vertices in such graphs: E = 


2.1.2 Asymptotic estimates 


We will use the following, quite standard and easily proven, estimates of binomial coefficients. Let 
1 < X < k < n integers. Then 
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2.1.3 Chebyshev and Chernoff 

Chebyshev’s Inequality helps to show concentration of a random variable X around its expectation, 
based of the first two moments of X. It reads as follows: let X be a random variable with expectation 
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/i and variance cr^. Then for any a > 0, 

Pr[\X - IJ.\ > aa] < . 

The following are very standard bounds on the lower and the upper tails of the Binomial distri¬ 
bution due to Chernoff: If X ~ Bin{n,p), then 

• Pr (X < (1 — a)np) < exp for every a > 0. 

• Pr (X > (1 -|- a)np) < exp for every 0 < a < 1. 

Another, trivial yet useful, bound is as follows: Let X ~ Bin{n,p) and /c G N. Then 

FriX >k)<{^y. 

Indeed, Pr(X > k) < < (^)^ 

2.1.4 Random graphs, asymptotic notation 

As usually, G{n,p) denotes the probability space of graphs with vertex set {1,... ,n} = [n], where 
every pair of distinct elements of [n] is an edge of G ~ G{n,p) with probability p, independently of 
other pairs. For 0 < m < X, G{n, m) denotes the probability space of all graphs with vertex set 
[n] and exactly m edges, where all such graphs are equiprobable: Pr[G\ = (^) . One can expect 

that the probability spaces G{n,p) and G{n,m) have many similar features, when the corresponding 
parameters are appropriately tuned: m = Np; accurate quantitative statements are available, see [3], 
[^. This similarity frequently allows to prove a desired property for one of the probability spaces, 
and then to transfer it to the other one. 

We will also address briefly the model D{n,p) of directed random graphs, defined as follows: the 
vertex set is [n], and each of the n(n — I) = 2N ordered pairs l<z/j<nisa directed edge of 
D{n,p) with probability p, independently from other pairs. 

We say that an event occurs with high probability, or whp for brevity, in the probability space 
G{n,p) if lim„_,.(X) Pr[G ~ G{n,p) G T„] = 1. (Formally, one should rather talk about a sequence 
of events {£n}n and a sequence of probability spaces {G(n,p)}n-) This notion is defined in other 
(sequences of) probability spaces in a similar way. 

Let A; > 2 be an integer, and assume that 0 < p,pi, ■ ■ ■ ,Pk < 1 satisfy (1—p) = Then 

the random graphs G ~ G{n,p) and G' = UiLi have the exact same distribution. Indeed, it 

is obvious that each pair of vertices 1 < i < j < n is an edge in both graphs G, G' independently of 
other pairs. In G, this edge does not appear with probability 1 — p, and in order for it not appear 
in G', it should not appear in any of the random graphs G{n,pi) - which happens with probability 
n(Li(l ~ Pi) = ^ ~ P^ the same probability as in G. This very useful trick is called multiple exposure 
as it allows to generate (to expose) a random graph G ~ G{n,p) in stages, by generating the graphs 
G{n,pi) sequentially and then by taking their union. In case when the last probability pk is much 
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smaller than the rest, it is also called sprinkling - a typical scenario in this case is to expose the bulk 
of the random graph G ~ G{n,p) first by generating the graphs G{n,pi), i = 1,... , k — 1, to come 
close to a target graph property P, and then to add few random edges from the last random graph 
G{n,pk) (to sprinkle these few edges) to finish off the job. 

2.2 Depth First Search and its applications for finding long paths 

The Depth First Search is a well known graph exploration algorithm, usually applied to discover 
connected components of an input graph. As it turns out, this algorithm is particularly suitable for 
finding long paths in graphs, and using it in the context of random graphs can really make wonders. 
We will see some of them later in this text. 

Recall that the DFS (Depth First Search) is a graph search algorithm that visits all vertices 
of a (directed or undirected) graph. The algorithm receives as an input a graph G = {V,E)] it is 
also assumed that an order tt on the vertices of G is given, and the algorithm prioritizes vertices 
according to tt. The algorithm maintains three sets of vertices, letting S be the set of vertices whose 
exploration is complete, T be the set of unvisited vertices, and U = V\{SDT), where the vertices of 
U are kept in a stack (the last in, first out data structure). It initiliazes with 5 = [/ = 0 and T = V, 
and runs till [/ U T = 0. At each round of the algorithm, if the set U is non-empty, the algorithm 
queries T for neighbors of the last vertex v that has been added to U, scanning T according to tt. If 
V has a neighbor u in T, the algorithm deletes u from T and inserts it into U. If v does not have a 
neighbor in T, then v is popped out of U and is moved to S. If U is empty, the algorithm chooses 
the first vertex of T according to tt, deletes it from T and pushes it into U. In order to complete 
the exploration of the graph, whenever the sets U and T have both become empty (at this stage 
the connected component structure of G has already been revealed), we make the algorithm query 
all remaining pairs of vertices in S' = F, not queried before. Figure [T] provides an illustration of 
applying the DFS algorithm. 

Observe that the DFS algorithm starts revealing a connected component C of G at the moment 
the first vertex of G gets into (empty beforehand) U and completes discovering all of G when U 
becomes empty again. We call a period of time between two consecutive emptyings of U an epoch, 
each epoch corresponding to one connected component of G. During the execution of the DFS 
algorithm as depicted in Figure [H there are two components in the graph G, and respectively there 
are two epochs - the first is Steps 1-6, and the second is Steps 7-16. 

The following properties of the DFS algorithm are immediate to verify: 

(Dl) at each round of the algorithm one vertex moves, either from T to U, or from U to S'; 

(D2) at any stage of the algorithm, it has been revealed already that the graph G has no edges 
between the current set S and the current set T; 

(D3) the set U always spans a path (indeed, when a vertex u is added to U, it happens because u 
is a neighbor of the last vertex v in U-, thus, u augments the path spanned by U, of which v is 
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Figure 1: Graph G with vertices labeled is on the left, and the protocol of applying the DFS algorithm 
to G is on the right. Observe that at any point of the algorithm execution the set U spans a path in 
G. 


the last vertex). 

We now exploit the features of the DFS algorithm to derive the existence of long paths in ex¬ 
panding graphs. 

Proposition 2.1 Let k,l be positive integers. Assume that G = (V,E) is a graph on more than k 
vertiees, in which every vertex subset S of size |S| = k satisfies: |A^g(5')| > 1. Then G contains a 
path of length I. 

Proof Run the DFS algorithm on G, with vr being an arbitrary ordering of V. Look at the moment 
during the algorithm execution when the size of the set S of already processed vertices becomes 
exactly equal to k (there is such a moment as the vertices of G move into S one by one, till eventually 
all of them land there). By Property (D2) above, the current set S has no neighbors in the current 
set T, and thus N(S) C U, implying \U\ > 1. The last move of the algorithm was to shift a vertex 
from U to S, so before this move the set U was one vertex larger. The set U always spans a path in 
G, by Property (D3). Hence G contains a path of length 1. □ 

Proposition 2.2 Let k < n be positive integers. Assume that G = (H, E) is a graph on n 
vertices, containing an edge between any two disjoint subsets S,T CV of size |5| = |T| = k. Then 
G contains a path of length n — 2k -\-l and a cycle of length at least n — 4/c + 4. 
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Proof Run the DFS algorithm on G, with tt being an arbitrary ordering of V. Consider the moment 
during the algorithm execution when IS"! = |T| — there is such a moment, by Property (Dl). Since 
G has no edges between the current S and the current T by Property (D2), it follows by the 
proposition’s assumption that both sets S and T are of size at most k — 1. This leaves us with the 
set U whose size satisfies: \U\ > n — 2k + 2. Since U always spans a path by (D3), we obtain a 
path P of desired length. To argue about a cycle, take the first and the last k vertices of P. By the 
proposition’s assumption there is an edge between these two sets, this edge obviously closes a cycle 
with P, whose length is at least n — 4/c + 4, as required. □ 

As we have hinted already, the DFS algorithm is well suited to handle directed graphs too. Similar 
results to those stated above can be obtained for the directed case. Here is an analog of Proposition 
12.21 for directed graphs; the proof is the same, mutatis mutandis. 

Proposition 2.3 Let k < n be positive integers. Let G = {V, E) he a directed graph on n vertices, 
such that for any ordered pair of disjoint subsets S,T C V of size |5| = |T| = k, G has a directed 
edge from S to T. Then G contains a directed path of length n — 2k + l and a directed cycle of length 
at least n — 4A: + 4. 

2.3 Posa’s Lemma and boosters 

In this section we present yet another technique for showing the existence of long paths in graphs. 
This technique, introduced by Posa in 1976 m in his research on Hamiltonicity of random graphs, 
is applicable not only for arguing about long paths, but also for various Hamiltonicity questions. 
And indeed, we will see its application in this context later. 

In quite informal terms, Posa’s Lemma guarantees that expanding graphs not only have long 
paths, but also provide a very convenient backbone for augmenting a graph to a Hamiltonian one by 
adding new (random) edges. The fact that expanders are good for getting long paths is already not 
new to us — Propositions 12.1] and 12.21 are just about this. Posa’s Lemma however quantifies things 
somewhat differently, and as a result yields further benefits. 

We start by defining formally the notion of an expander. 

Definition 2.4 For a positive integer k and a positive real a, a graph G = iV, E) is a {k, a)-expander 
if |-^g(P)| > a\U\ for every subset U C V of at most k vertices. 

By the way of example, Proposition 12.II can now be rephrased (in a somewhat weaker form - we 
now require the expansion of all sets of size up to k) as follows: if G is a (fc, a)-expander, then G has 
a path of length at least ak. For technical reasons, Posa’s Lemma uses the particular case a = 2. 

The idea behind Posa’s approach is fairly simple and natural — one can start with a (long) path 
P, and then, using extra edges, perform a sequence of simple deformations (rotations), till it will be 
possible to close the deformed path to a cycle, or to extend it by appending a vertex outside V{P)] 
then this can be repeated if necessary to create an even longer path, or to close it to a Hamilton cycle. 
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Figure 2: Elementary rotation is applied to a path P to get a new path P' with the same vertex set. 

The approach is thus called naturally the rotation-extension technique. Formally, let P = xqXi .. .xh 
be a path in a graph G = {V,E), starting at a vertex xq. Suppose G contains an edge {xi,Xh) for 
some 0 < i < h — 1. Then a new path P' can be obtained by rotating the path P at Xi, i.e. by 
adding the edge {xi,Xh) and erasing (xi,Xj+i). This operation is called an elementary rotation and 
it depicted in Figure [2l Note that the obtained path P' has the same length h and starts at xq. 
We can therefore apply an elementary rotation to the newly obtained path P', resulting in a path 
P" of length h, and so on. If after a number of rotations an endpoint x of the obtained path Q is 
connected by an edge to a vertex y outside Q, then Q can be extended by adding the edge (x,y). 
The power of the rotation-extension technique of Posa hinges on the following lemma. 

Lemma 2.5 173]/ Let G be a graph, P be a longest path in G and V be the set of all paths obtainable 
from P by a sequence of elementary rotations. Denote by R the set of ends of paths in V, and 
by R~ and R'^ the sets of vertices immediately preceding and following the vertices of R along P, 
respectively. Then Ng{R) C R~ U R~^ . 

Proof Let x G i? and y G V{G) \ {RU R~ U and consider a path Q G P ending at x. If 
y G V{G) \ V{P), then (x,y) 0 E{G), as otherwise the path Q can be extended by adding y, thus 
contradicting our assumption that P is a longest path. Suppose now that y G V{P) \ {RU R~ U R~^). 
Then y has the same neighbors in every path in V, because an elementary rotation that removed 
one of its neighbors along P would, at the same time, put either this neighbor or y itself in R (in 
the former case y G R~ U P+). Then if x and y are adjacent, an elementary rotation applied to 
Q produces a path in V whose endpoint is a neighbor of y along P, while belonging itself to R, a 
contradiction. Therefore in both cases x and y are non-adjacent. □ 

The following immediate consequence of Lemma 12.51 is frequently applied in Hamiltonicity prob¬ 
lems in random graphs. 

Corollary 2.6 Let h, k be positive integers. Let G = {V, E) be a graph such that its longest path has 
length h, but it contains no cycle of length h + 1. Suppose furthermore that G is a {k, 2)-expander. 
Then there are at least non-edges in G such that if any of them is turned into an edge, then 

the new graph contains an {h -\-1)-cycle. 

Proof Let P = xqXi .. .Xh be a longest path in G and let R, R~ ,R~^ be as in Lemma 12.51 Notice 
that \R~\ < \R\ and \R'^\ < |i?| — 1, since Xh & R has no following vertex on P and thus does not 
contribute an element to R'^. 

According to Lemma 12.51 

\Ng{R)\ < \R~ U P+l < 2|P| - 1 , 
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and it follows that |ii| > k. (Here the choice a = 2 in the definition of a (A;, a)-expander plays a 
crucial role.) Moreover, {xq,v) is not an edge for any v G R (there is no {h + l)-cycle in the graph), 
whereas adding any edge (xo,n) for n G i? creates an {h + l)-cycle. 

Fix a subset {yi,... ,yk+i} C R. For every yi G R, there is a path Pi ending at yi, that can be 
obtained from P by a sequence of elementary rotations. Now fix y^ as the starting point of Pi and 
let Yi be the set of endpoints of all paths obtained from Pi by a sequence of elementary rotations. 
As before, \Yi\ > /c + 1, no edge joins yi to F), and adding any such edge creates a cycle of length 
h + 1. Altogether we have found {k + 1)^ pairs {yi, Xij) for Xij G F). As every non-edge is counted at 
most twice, the conclusion of the lemma follows. □ 

The reason we are after a cycle of length /i -|- 1 in the above argument is that if h + 1 = n, 
then a Hamilton cycle is created. Otherwise, if the graph is connected, then there will be an edge e 
connecting a newly created cycle C of length h + 1 with a vertex outside C. Then opening C up and 
appending e in an obvious way creates a longer path in G. 

In order to utilize quantitatively the above argument, we introduce the notion of boosters. 

Definition 2.7 Given a graph G, a non-edge e = {u, v) of G is called a booster if adding e to G 
creates a graph G', which is Hamiltonian or whose longest path is longer than that of G. 

Note that technically every non-edge of a Hamiltonian graph G is a booster by definition. 

Boosters advance a graph towards Hamiltonicity when added; adding sequentially n boosters 
clearly brings any graph on n vertices to Hamiltonicity. 

We thus conclude from the previous discussion: 

Corollary 2.8 Let G be a connected non-Hamiltonian {k, 2)-expander. Then G has at least 
boosters. 

3 Long paths in random graphs 

In this section we treat the appearance of long paths and cycles in sparse random graphs. We 
will work with the probability space G{n,p) of binomial random graphs, analogous results for the 
sister model G{n, m) can be either proven using very similar arguments, or derived using available 
equivalence statements between the two models. 

Our goal here is two-fold: we first prove that already in the super-critical regime p = the 
random graph G{n,p) contains typically a path of length linear in re; then we prove that in the 
regime p = ^, the random graph G{n,p) has typically a path, covering the proportion of vertices 
tending to 1 as the constant C increases. We will invoke the approaches and results developed in 
Section [2.21 Ithe Depth First Search Algorithm and its consequences) to achieve both of these goals, 
in fact in a rather short and elegant way. 
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3.1 Linearly long paths in the supercritical regime 

In their groundbreaking paper [6] from 1960, Paul Erdos and Alfred Renyi made the following 
fundamental discovery: the random graph G{n,p) undergoes a remarkable phase transition around 
the edge probability p(n) = 1. For any constant e > 0, if p = then G{n,p) has whp all connected 
components of size at most logarithmic in n, while for p = whp a connected component of linear 
size, usually called the giant component, emerges in G{n,p) (they also showed that whp there is 
a unique linear sized component). The Erdos-Renyi paper, which launched the modern theory of 
random graphs, has had enormous influence on the development of the field. We will be able to 
derive both parts of this result very soon. 

Although for the super-critical case p = the result of Erdos and Renyi shows a typical 
existence of a linear sized connected component, it does not imply that a longest path in such a 
random graph is whp linearly long. This was established some 20 years later by Ajtai, Komlos and 
Szemeredi [T]. In this section we present a fairly simple proof of their result. We will not attempt 
to achieve the best possible absolute constants, aiming rather for simplicity. Our treatment follows 
closely that of [12]. 

The most fundamental idea of the proof is to run the DFS algorithm on a random graph G ~ 
G{n,p), constructing the graph “on the fly”, as the algorithm progresses. We first hx the order vr 
on V{G) = [n] to be the identity permutation. When the DFS algorithm is fed with a sequence of 
i.i.d. Bernoulli(p) random variables X = so that is gets its i-th query answered positively if 

Aj = 1 and answered negatively otherwise, the so obtained graph is clearly distributed according to 
G{n,p). Thus, studying the component structure of G can be reduced to studying the properties of 
the random sequence X. This is a very useful trick, as it allows to “flatten” the random graph by 
replacing an inherently two-dimensional structure (a graph) by a one-dimensional one (a sequence 
of bits). In particular (here and later we use the DFS algorithm-related notation of Section 12.2p , 
observe crucially that as long as T 7 ^ 0, every positive answer to a query results in a vertex being 
moved from T to U, and thus after t queries and assuming T 7 ^ 0 still, we have [S' U t/| > Yl\=i 
(The last inequality is strict in fact as the first vertex of each connected component is moved from T 
to U “for free”, i.e., without need to get a positive answer to a query.) On the other hand, since the 
addition of every vertex, but the first one in a connected component, to U is caused by a positive 
answer to a query, we have at time t: IDI < 1 -|- Yll=i 

The probabilistic part of our argument is provided by the following quite simple lemma. 


Lemma 3.1 Let e > 0 be a small enough constant. Consider the sequence X = (Aj))^^ of i.i.d. 
Bernoulli random variables with parameter p. 


1. Let p = Let fc = ^Inn. Then whp there is no interval of length kn in [A], in whieh at 
least k of the random variables Aj take value 1. 


2. Let p = . Let Aq = ^. Then whp 


e£ V - 


e(l+e)n 

2 


< n2/3. 
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Proof 1) For a given interval I of length kn in [N], the sum distributed binomially with 

parameters kn and p. Applying the Chernoff bound to the upper tail of B{kn,p), and then the union 
bound, we see that the probability of the existence of an interval violating the assertion of the lemma 
is at most 


{N — k + l)Pr[B{kn,p) > k] < ■ e 3 < n 


2 


^1 A Inn \ 

3 ^ = 0 ( 1 ) , 


for small enough e > 0. 

2) The sum Xi is distributed binomially with parameters Nq and p. Hence, its expectation is 
XoP = , and its standard deviation is of order y/n. Applying the Chebyshev inequality, 

we get the required estimate. □ 

Now we are ready to formulate and to prove the main result of this section. 


Theorem 3.2 Let e > 0 be a small enough constant. Let G ~ G{n,p). 

1. Let p = Then whp all connected components of G are of size at most ^Inn. 

2. Let p = . Then whp G contains a path on at least ^ vertices. 

In both cases, we run the DFS algorithm on G ~ G{n,p), and assume that the sequence X = 
of random variables, defining the random graph G ~ G{n,p) and guiding the DFS algorithm, satisfies 
the corresponding part of Lemma l3.11 

Proof 1) Assume to the contrary that G contains a connected component C with more than k = 
^ In n vertices. Let us look at the epoch of the DFS when G was created. Consider the moment 
inside this epoch when the algorithm has found the {k + l)-st vertex of G and is about to move it to 
U. Denote A5 = S' n C at that moment. Then |AS U t/| = A:, and thus the algorithm got exactly k 
positive answers to its queries to random variables Xi during the epoch, with each positive answer 
being responsible for revealing a new vertex of G, after the first vertex of G was put into U in the 
beginning of the epoch. At that moment during the epoch only pairs of edges touching AS U U 
have been queried, and the number of such pairs is therefore at most ( 2 ) + — k) < kn. It thus 

follows that the sequence X contains an interval of length at most kn with at least A: I’s inside — a 
contradiction to Property 1 of Lemma 13.11 

2) Assume that the sequence X satisfies Property 2 of Lemma 13.11 We claim that after the first 
2 2 
Nq = ^ queries of the DFS algorithm, the set U contains at least ^ vertices (with the contents of 

U forming a path of desired length at that moment). Observe first that |5| < at time Nq. Indeed, 

if |5| > |, then let us look at a moment t < Nq where IS"! = | (such a moment surely exists as 

vertices flow to S one by one). At that moment \U\ < 1 + Xi < ^ hy Property 2 of Lemma [3.11 

Then |T| = n — \S\ — \U\ > |, and the algorithm has examined all 151 • |r| > ^ > Aq pairs between 

S and T (and found them to be non-edges) — a contradiction. Let us return to time Nq. If |5| < | 
2 

and \U\ < ^ then, we have T 7 ^ 0. This means in particular that the algorithm is still revealing 
the connected components of G, and each positive answer it got resulted in moving a vertex from 
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T to U (some of these vertices may have already migrated further from U to S). By Property 2 of 
Lemma [Q the number of positive answers at that point is at least Hence we have 

|5UC/| > -n2/3. If \u\ < 4^, then |5| > f + ^ - n^/^. All |5||r| > |5| (n-\S\ - 

pairs between S and T have been probed by the algorithm (and answered in the negative). We thus 
get: 


en 

~Y 


e^n 


iV„>|5||u-|S|-^l>(f+ 


en e“n n /q 

n -h n 

2 2 


=-^- 0(e^)n^ > - 

2 20 ^ ^ 2 




en 


(we used the assumption |5| < | in the second inequality above), and this is obviously a contradiction, 
completing the proof. □ 


Let us discuss briefly the obtained result and its proof. First, given the probable existence of a 
long path in G{n,p), that of a long cycle is just one short step further. Indeed, we can use sprinkling 
as follows. Let p = for small e > 0. Write 1 — p = (1 — pi)(l — P 2 ) with p 2 = thus, most of 
the probability p goes into pi > Let now G ~ G{n,p), Gi ~ G{n,pi), G 2 ~ G{n,p 2 ), we can 

represent G = GiD G 2 - By Theorem 13.21 Gi whp contains a linearly long path P. Now, the edges 
of G 2 can be used to close most of P into a cycle - there is -whp an edge of G 2 between the first and 
the last (say) vertices of P. 

The dependencies on e in both parts of Theorem 13.21 are of the correct order of magnitude - for 
p = a largest connected component of G{n,p) is known to be whp of size 0(e“^) logn while for 

p = a longest cycle of G{n,p) is whp of length 0(e^)n (see, e.g.. Chapter 6 of [5]). 

Observe that using a Chernoff-type bound for the tales of the binomial random variable instead 
of the Chebyshev inequality would allow us to claim in the second part of Lemma 13.11 that the 
sum Xi is close to with probability exponentially close to 1. This would show in turn, 

employing the argument of Theorem 13.21 that G(n,p) with p = contains a path of length linear 
in n with exponentially high probability, namely, with probability 1 — exp{—c(e)n}. 

As we have mentioned in Section 12.21 the DFS algorithm is applicable equally well to directed 
graphs. Hence essentially the same argument as above, with obvious minor changes, can be applied 
to the model D{n,p) of random digraphs. It then yields the following theorem: 


Theorem 3.3 Let p = for e > 0 constant. Then the random digraph D{n,p) has whp a 
directed path and a directed cycle of length 0(e^)n. 


This recovers the result of Karp m- 


3.2 Nearly spanning paths 

Consider now the regime p = ^, where C is a (large) constant. Cur goal is to prove that whp in 
G{n,p), the length of a longest path approaches n as G tends to infinity. This too is a classical result 
due to Ajtai, Komlos and Szemeredi [1], and independently due to Fernandez de la Vega [7j. It is 


II 








fairly amusing to see how easily it can be derived using the DFS-based tools we developed in Section 

E2 


Theorem 3.4 For every e > 0 there exists C = C{e) > Q such that the following is true. Let 
G ~ G{n,p), where p = ^. Then whp G contains a path of length at least (1 — e)n. 


Proof Clearly we can assume e > 0 to be small enough. Let k = . By Proposition 12.21 it suffices 

to show that G ~ G{n,p) contains whp an edge between every pair of disjoint subsets of size k of 
V{G). For a given pair of disjoint sets S,T of size IS"! = |T| = k, the probability that G contains 
no edges between S and T is exactly (1 — p)^^ (all pairs between S and T come out non-edges in 
G). Using the union bound, we obtain that the probability of the existence of a pair violating this 
requirement is at most 



n — k\ . .u 2 

(i-p) < 



a-pf 


< 




Recalling the value of k and taking G = ^ guarantees that the above estimate vanishes (in fact 
exponentially fast) in n, thus establishing the claim. □ 


As before, getting a nearly spanning cycle whp can be easily done through sprinkling. 


A similar statement holds for the probability space D{n,p) of random directed graphs, with an 
essentially identical proof. 


4 The appearance of Hamilton cycles in random graphs 

The main aim of this section is to establish the Hamiltonicity threshold in the probability space 
G{n,p), this is the minimum value of the edge probability p{n), for which a random graph G drawn 
from G{n,p) is whp Hamiltonian. By doing so we will prove a classical result of Komlos and 
Szemeredi dU and independently of Bollobas g]. 

Let us start by providing some intuition on where this threshold is expected to be located. It is 
well known that the threshold probability for connectivity in G{n,p) is p = More explicitly, one 
can prove (and we leave this as an exercise) that for any function a;(n) tending to infinity arbitrarily 
slowly with n, if p = then whp G ~ G{n,p) is not connected, whereas for p = 

whp G ~ G{n,p) is connected. Perhaps more importantly, the main reason for the threshold for 
connectivity to be around Inn/n is that precisely at this value of probability the last isolated vertex 
in G{n,p) typically ceases to exist. Of course, the graph cannot be connected while having isolated 
vertices, and this is the easy part of the connectivity threshold statement; the hard(er) part is to 
prove that if p{n) is such that S{G) > 1 whp, then G is whp connected. 

If so, we can suspect that the threshold for Hamiltonicity of G{n,p) coincides with that of non¬ 
existence of vertices of degree at most one, the latter being an obvious necessary condition for 
Hamiltonicity. This is exactly what was proven in min]- Let us therefore set our goal by stating 
first a fairly accessible result about the threshold for S{G) > 2, both in G{n,p) and in G{n,m). 
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Proposition 4.1 Let oj{n) he any function tending to infinity arbitrarily slowly with n. Then: 

• in the probability space G{n,p), 

1. if p{n) = ^ Q G{n,p) whp satisfies 6{G) < 1; 

2. if p{n) = ^ Q G{n,p) whp satisfies 6{G) > 2 ; 

• in the probability space G{n,m), 

1. ifm{n) = inn-i^(n))n ^ Q ^ whp satisfies 6{G) < 1; 

2. if m{n) = q ^ G{n,m) whp satisfies 6{G) > 2 . 

Proof Straightforward application of the first (for proving S{G) > 2) and the second (for proving 
S{G) < 1) moment methods in both probability spaces; left as an exercise. □ 

Hence our goal will be to prove that tor p[n) = - - -^ and tor m[n) = - - 2 - 

the random graphs G{n,p) and G{n,m) respectively are whp Hamiltonian. 

We will actually prove a stronger, and a much more delicate, result about the hitting time of 
Hamiltonicity in random graph processes. Let us first define this notion formally. Let a : E{Kn) —?• 
[A^] be a permutation of the edges of the complete graph Kn on n vertices, we can write a = 
(ei,... ,eAr), where -^ = ( 2 )- ^ graph process G = G{a) is a nested sequence G = (G'i)(^o, where 
the graph Gi has [n] as its vertex set and {ei,..., e^}, the prefix of a of length i, as its edge set. The 
sequence {Gi) thus starts with the empty graph on n vertices, finishes with the complete graph on 
n vertices, and its i-th element Gi has exactly i edges; moreover, it is nested, as for i > 1 the graph 
Gi is obtained from its predecessor Gi-i by adding the i-th edge e* of a. We can view G{a) as a 
graph process (as the name indicates suggestively) or as an evolutionary process, unraveling from 
the empty graph to the complete graph, as guided by a. 

Now, we introduce the element of randomness in the above definition. Suppose the permutation 
a is drawn uniformly at random from the set of all A^! permutations of the edges of Kn- Then the 
corresponding process G{a) is called a random graph process. We can describe it in the following 
equivalent way: start by setting Go to be the empty graph on n vertices, and for each 1 < i < A^, 
obtain Gi by choosing an edge e* of Kn missing in Gj-i uniformly at random and adding it to Gj_i. 
This very nice and natural probability space models a random evolutionary process in graphs; here 
too we proceed from the empty graph to the complete graph, but in a random fashion. 

Random graph processes are so important not just because they model evolution very nicely; 
in fact, they embed the probability spaces G{n,m) for various m; due to standard connections 
between G(n, m) and G{n,p) one can also claim they “contain” G{n,p) as well. Observe that running 
a random process G and stopping it (or taking a snapshot) at time m produces the probability 
distribution G{n,m). Indeed, every graph G with vertex set [n] and exactly m edges is the m-th 
element of the same number of graph processes, namely, of m\{N — m) \ of them. Thus, understanding 
random graph processes usually leads to immediate consequences for G{n,m), and then for G{n,p), 
and Hamiltonicity is not exceptional in this sense. 
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Let P be a property of graphs on n vertices; assume that P is monotone increasing (i.e., adding 
edges preserves it), and that the complete graph possesses P (one can think of P as being the 
property of Hamiltonicity). Then, given a permutation a : E{Kn) —)• [N] and the corresponding 
graph process G{a), we can define the first moment i when the i-th element Gi of G has P. This is 
the so called hitting time of P, denoted by T-p{G{a))-. 

T'p{G{a)) = min{i > 0 : has P} . 

Of course, due to the monotonicity of P from this point till the end of the process the graphs Gi 
all have P. When G is a random graph process, the hitting time T'p[G) becomes a random variable, 
and one can study its typical behavior. A related task is to compare two hitting times, and to try 
to bundle them, deterministically or probabilistically. 

We now state the main result of this section, due to Ajtai, Komlos and Szemeredi [2], and to 
Bollobas [1]. 

Theorem 4.2 Let G be a random graph proeess on n vertices. Denote by T 2 (G) and Ty^{G) the hitting 
times of the properties of having minimum degree at least 2, and of Hamiltonicity, respectively. Then 

whp." 

T2iG) = Tn(G). 

In words, for a typical graph process, Hamiltonicity arrives exaetly at the very moment the last 
vertex of degree less than two disappears. Of course, it cannot arrive earlier deterministically, so the 
main point of the above theorem is to prove that typically is does not arrive later either. 

As we indicated above, random graph process results are usually more powerful than those for 
concrete random graph models. Here too we are able to derive the results for G{n,m) and G{n,p) 
easily from the above theorem. 

Corollary 4.3 Let m{n) = (^°^Hninn+a;(n))n ^ g, random graph G ~ G{n,m) is whp Hamilto¬ 

nian. 

Proof Generate a random graph G distributed according to G{n, m) by running a random graph 
process G and stopping it at time m. By Proposition 14.11 we know that T 2 (G) < m. Theorem 14.21 
implies that typically T 2 (G) = T'n{G), and thus the graph of the process has become Hamiltonian 
not later than m. Hence G is whp Hamiltonian as well. □ 

Corollary 4.4 Letp{n) = ^^’^HninTi+a;(n) ^ ^ random graph G G{n, m) is whp Hamiltonian. 

Proof Observe that generating a random graph G ~ G{n,p) and conditioning on its number of 
edges being exactly equal to m produces the distribution G{n,m). Let a;i(n) = w(n)/3. Denote 
I = [Np — nuJi{n),Np + nwi(n)]. Observe that for every m & I, the random graph G ~ G(n, m) is 
whp Hamiltonian by Corollary 14.31 Also, the number of edges in G{n,p) is distributed binomially 
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with parameters N and p and has thus standard deviation less than ^/Np <C na;i(n). Applying 
Chebyshev we derive that whp \E{G)\ G I. Hence 

N 

Pr[G ~ G{n,p) is not Hamiltonian] = Pr[\E{G)\ = m] • Pr[G is not Hamiltonian| \E(G) = m] 

m=0 

< Pr[\E{G)\ 0 /] + Pr[\E{G)\ = m] Pr[G is not Hamiltonian| \E{G) = m] 

m&I 

= o{i) + E Pr\\E{G)\ = m] Pr[G ~ G{n,m) is not Hamiltonian| = o(l) • Pr[Bin{N,p) G /] 

mGl 

= o(l). 


□ 

Now we start proving Theorem 14.21 The proof is somewhat technical, so before diving into its 
details, we outline its main idea briefly. Recall that our goal is to prove that for a typical random 
process G, we have T 2 (G) = r-^(G). In order to prove this, we will take a very close look at the 
snapshot Gj-j of G, aiming to prove that this graph is whp Hamiltonian. By definition, the minimum 
degree of is exactly two, and it is thus quite reasonable to expect that this graph is typically a 
(/c, 2)-expander for k = 0(n). This is true indeed, however such expansion by itself does not quite 
guarantee Hamiltonicity. As indicated in Section 12.31 expanders form a very convenient backbone 
for augmenting a graph to a Hamiltonian one — according to Corollary 12.81 every connected non- 
Hamiltonian (fc, 2)-expander has Q{k‘^) boosters. Observe though that since we aim to prove a hitting 
time result, we cannot allow ourselves to sprinkle few random edges on top of our expander — a 
Hamilton cycle should appear at the very moment the minimum degree in the random graph process 
becomes two. We will circumvent this difficulty in the following way: we will argue that the snapshop 
Gt 2 typically is not only a good expander by itself, but also contains a subgraph Tq which is about 
as good an expander as G^-j is, but contains only a small positive proportion of its edges. Having 
obtained such Tq, we will start looking for boosters relative to Tq, but already contained in our 
graph Gt -2 - thus avoiding the need for sprinkling. We will argue that G^-j is typically such that it 
is contains a booster with respect to every sparse expander in it. If this is the case, then we will be 
able to start with Tq and to update it sequentially by adding a booster after a booster (at most n 
boosters will need to be added by definition), until we will finally reach Hamiltonocity - all within 
Gt-j; observe crucially that at each step of this augmentation procedure the updated backbone Tj, 
obtained by adjoining to Tq the previously added boosters, has at most n more edges than Tq and 
is thus still a sparse subgraph of G^j; of course the required expansion is inherited from an iteration 
to iteration. Then our claim about G^-j typically containing a booster with respect to every sparse 
expander within is applicable, and we can push the process through. This is quite a peculiar proof 
idea - it appears that the random graph is helping itself to become Hamiltonian! 

Let us get to work. As outlined before, we run a random graph process G and take a snapshot 
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at the hitting time T 2 = T 2 {G). Denote 


nlnn 
m 2 = re In n . 


Observe that by Proposition 14.11 we have that whp mi < T 2 < m 2 - Let 

do = [<^0 In rej 


where do > 0 is a sufficiently small constant to be chosen later, and denote, for a graph G on re 
vertices, 

SMALL{G) = {vG V{G) : d{v) < do} ■ 

Observe that for G ~ G{n,m) with m > mi, the expected vertex degree is asymptotically equal 
to In re. Thus falling into SMALL[G) is a rather rare event, and we can expect the vertices of 
SMALL{G) to be few and far apart in the graph. In addition, such G should typically have a very 
nice edge distribution, with no small and dense vertex subsets, and many edges crossing between 
any two large disjoint subsets. This is formalized in the following lemma. 


Lemma 4.5 Let G = {Gi)^Q be a random graph process on re vertices. Denote G = Gt-j, where 
T 2 = T 2 {G) is the hitting time for having minimum degree two in G. Then whp G has the following 
properties: 


(PI) A{G) < 10 In re; 6{G) > 2; 

(P2) \SMALL{G)\ < reO'3; 

(P3) G does not contain a non-empty path of length at most 4 such that both of its (possibly identical) 
endpoints lie in SMALL[G); 


(P4) every vertex subset U C [re] of size \U\ < ^ spans at most \U\ ■ In^^^re edges in G; 

(P5) for every pair of disjoint vertex subsets U,W of sizes \U\ < ^^1/2 ^ ; |bL| < \U\ ■ In^^^ re, the 
number of edges of G crossing between U and W is at most 


(P6) for every pair of disjoint vertex subsets U, W of size \U\ = |IT| 
edges between U and W. 


n 


G has at least 0.5re 


Proof The proof is basically a fairly standard (though tedious) manipulation with binomial coef¬ 
ficients. We will thus prove several of the above items, leaving the proof of remaining ones to the 
reader. 

(PI): Observe that since whp T 2 < m 2 , it is enough to prove that in G ~ G(re, m 2 ) there are whp 
no vertices of degree at least 10 In re. For a given vertex u G [re], the probability that v has degree at 
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least 10 In re in G{n,m 2 ) is at most 

/ n - 1 \ ( ^ / ere yoinn lOlnn 

VlOlnrey “ VlOlnre/ \n) 

by the standard estimates on binomial coefficients stated in Section 12.1.21 After cancellations we 
see that the above estimate is at most (ere/5(re — = o(l/re). Applying the union bound we 

obtain that typically at time m 2 , and thus at T 2 < m 2 as well, there are no vertices of degree at least 
10 In re. The bound on 5{G) is immediate from the definition of T 2 . 

(P2): Notice that since adding edges can only decrease the size of SMALL{G), it is enough to prove 
that typically already at time rrei \SMALL{Gm-i)\ Let G ~ G(re, rre-i). If|5MALL(G)| > re*^'^, 

then G contains a subset Vq C F, |Vb| = A: = [re^'^] such that eG(Vo, V — Vq) < d^k. The probability 
of this to happen in G{n,mi) is at most: 




N—k{n—k)\ /N—i\ 

mi—i ) \rai—i) 



< 



E 

iKdgk 


ekn\^ -i)(k(n-k)-i) 

- • e 

* / 



O.Bkm^^n 

• e ^ 



E {dok + 1) 


3re°-7 


31nre y° 0.8 In 

do J 


0 ( 1 ) > 


for 5o small enough. 

(P3): Since whp mi < T 2 < m 2 , it is enough to prove the following statement:whp every two 
(possibly identical) vertices of SMALL{Gmi) are not connected by a path of length at most 4 in 
Gm2 ■ 

Let us prove first that whp there is no such path in Gmi ~ G{n,mi). We start with the case 
where the endpoints of the path are distinct. Fix 1 < r < 4, a sequence P of distinct vertices 
Vo, - ■■ ,Vr in [re] and denote by Ap the event {vi, Ui+i) G E{Gmi every 0 < i < r — 1. Then 


Pr[Ap] 




In re 


re — 1 


r 


If we now condition on Ap, then the two edges (uo,ui) and {vr-i,Vr) are present in Gmi- Thus in 
order for both vo,Vr to fall into SMALL{Gmi), out of 2 re — 4 potential edges between {uoj^^r} and 
the rest of the graph (the edges (uq, ui), {vr-i,Vr) are excluded from the count), only at most 2do — 2 
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are present in Gmi ■ Hence: 


Pr[vo,Vr G SMALL{Grm)\Ap] 


< 


2dQ —2 

E 

i=0 


2n-4\ ( 


fN—r—2n-\-A\ 
mi—r—i ) 


< (24-1) 


2n — 4 

24 — 2 


/ N-r \ 

\mi—r) 

/ N—r—2n+4 \ 
Vmi—1—2^0+2/ 


E 24 


E 24 ■ 


E 24 ■ 


2n - 4 \ (, 


2d{) — 2 


/ N—r \ 
\mi—rJ 
N—r—2n-\-4 \ 
mi—r—2f/o+2/ 


en 


/ N—r—2do+2 ^ 
Vmi—r—2 
2dQ —2 


-2t/Q-|-2/ 


/ N—r—2dQ-\-2 \ 
Vmi—r— 2 do+ 2 / 
/ N—r \ 
Vmi—r/ 


4 — 1 

emin V 

(4 - i)n) 


(mi — r — 2dQ+2) (2n — 2dQ —2) 
— r —2dQ+2 


2dQ —2 


< n 


-i.g 


\ 2dQ—2 

N-r ) 


for 4 small enough. Hence, applying the union bound over all such sequences of r + 1 vertices, we 
conclude that the probability that there exists a path in G^i of length at most 4, connecting two 
distinct vertices from SMALL{Gmi) ^1 most = o(l). The case where 

the endpoitns of the path are identical is treated similarly. 

In light of the above, we can assume that after mi steps of the random graph process the 
current graph does not have a forbidden short path between the vertices of SMALL. Moreover, 
by (P2) we can assume that \SMALL{Gmi)\ E Now, let us run the process between mi 

and m2. In order for the i-th edge of the process, mi < i < m2, to close a short path between 
the vertices of SMALL{Gmi), it should fall inside a current set U of vertices at distance at most 
3 from SMALL{Gmi)- We have proven (property (PI)) that in fact whp the maximum degree 
of Gm 2 a-s well is at most 10Inn. Hence, whp in this time interval, the set U has size at most 

\SMALL{Gmi \ ■ (lOlnn)^, and thus the probability of the i-th edge of the process to fall inside U 

nu\\ 

is at most = o(n“^'^). Taking the union bound over all such i in the interval (mi,m2], we 

establish the desired property. 

Properties (P4)-(P6) can be proven quite similarly (and are in fact simpler to prove), and we 
spare the reader from the (perhaps somewhat boring...) task of reading their proofs. 

□ 


The above stated properties (PI) (P6) are sufficient to prove that is a very good expander 
by itself. Our goal is somewhat different though ~ we aim to prove that G7-2 contains a much sparser, 
but still fairly good expander. For this purpose, assume that a graph G = (F, E) has properties 
(P1)-(P6). Form a random subgraph Fq of G as follows. For every v £ V — SMALL{G), choose 
a set E{v) of 4 edges of G incident to v uniformly at random; for every v G SMALL{G), define 
E{v) to be the set of all edges of G touching v. Finally, define Fq to be the spanning subgraph of G, 
whose edge set is: 

i?(ro) = lJ^(^')- 
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In words, in order to form Fq we retain all edges touching the vertices of SMALL{G), and sparsify 
randomly other edges. 

Lemma 4.6 With high probability (over the choices of E{v)) the subgraph Fq is a {k, 2)-expander 
with at most don edges, where k = j. 

Proof Since by dehnition \E{v)\ < do for every v ^ V, \i follows immediately that |i?(Fo| < don. 
We now prove that typically Fq has the following property: 

(P7) For every pair of disjoint sets U,W of size \U\ = \W\ = ^ has at least one edge 

between U and W. 


Fix sets U, W as above. We know by (P6) that G has at least 0.5n edges between U and W. For 
a vertex u G U, the probability that none of the edges between u and W falls into E{u) is at most 


(dG{u)-dG{u,^ 
\ dp 


dQ-d(^{u,W) 


< p dQ{u) g 10 In 




by (PI). Hence the probability that none of the vertices u from U chooses an edge between u and 
W to be put into its set E{u) is at most: 


^ ^-^.eGiU,W) ^ g-e(n) 


Applying the union bound over all choices of U, W gives the desired claim. 

We now claim that for every graph G, and every its subgraph Fq of minimum degree 2 satisfying 
properties (P2), (P3), (P4), (P5), (P7), the subgraph Fq is an (n/4,2)-expander. In order to 
verify this claim, let S C [n] be a subset of size |5| < n/4. Denote Si = S Ci SMALL{G), S 2 = 
S — SM ALL{G). Consider first the case where |S'2| < ^ ■ Since (5(Fo) > 2, and all vertices from 

SMALL are at distance more than 4 from each other by (P3), we obtain: |Aro(S'i)| > 2|5i|. As for 
vertices from S 2 , they are all of degree at least do in Fq. The set S 2 spans at most |52| - n edges in 
G, and thus in Fq, according to (P4). It thus follows that ero(S'2, V—S 2 ) > (io|'S'2| —2ero(S'2) > ^4^^. 
Hence |A^ro(5'2)| > |-S'2| • ln^/"^n, by (P5). Finally, notice that, due to the non-existence of short 
paths connecting SMALLiG) again, the set Si U A^ro('S'i) contains only one vertex from uU A^ro('*^) 
for every u G S 2 (here we use the fact the forbidden paths have length at most 4). Therefore, 
|(Si U Nr,{Si)) n (52 U A^ro(52))| < |52|. Altogether, 


\Nr,{S)\ = |iVro(52)\5i| + |iVr„(5i)-(52UiVro(52))| 

> |52|(lni/4n-l) + 2|5i|-|52| >2(|5 i| + |52|) = 2|5|. 


as required. The complementary case ^Jj 2 ^ ^ l'S2| < f is very simple: by property (P7), such 
S 2 misses at most n/ln^/^n vertices in its neighborhood in Fq, also |Si| < \SMALL{G)\ < n^'^ by 
(P2). In follows that |A^ro(-5)| >n- - |52| - |5MALL(G)| > §. □ 
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Notice that every {j, 2)-expander F onn vertices is necessarily connected. Indeed, if such F is not 
connected, then consider its connected component C of size \C\ < and take U to be an arbitrary 
subset of C of size \U\ = min{ , ICI}. Then the external neighborhood of C/ in F has size at least 
2\U\ > \C — U\ by our expansion assumption, and falls entirely within C — a contradiction. 

As we have stated already in this text, expanders are not necessarily Hamiltonian themselves, but 
they are amenable to reaching Hamiltonicity by adding extra (random) edges, as they contain many 
boosters. However, in our circumstances we do not have extra time for sprinkling, and the required 
boosters should come from within the already existing edges of the random graph. Fortunately, a 
random graph G{n, m) with m = m{n) in the relevant range has whp a booster with respect to any 
sparse expander it contains, as given by the following lemma. 

Lemma 4.7 Let G = {Gi)f^Q he a random graph process on n vertices. Denote G = where 
T 2 = T 2 {G) is the hitting time for having minimum degree two in G. Assume the constant do is small 
enough. Then whp for every {n/A, 2)-expander T <Z G with H(F) = V{G) and |i?(F)| < don + n, F 
is Hamiltonian, or G contains at least one booster with respect to F. 


Proof Recall that every connected {k, 2)-expander F is Hamiltonian or has at least k'^/2 boosters, 
by Corollary 12.81 In order for a random graph G to violate the assertion of the lemma, G should 
contain some (re/4, 2)-expander F with few edges, but none of at least as many as re^/32 boosters 
relative to F (note that the required connectivity of G is delivered by the expansion of F, as explained 
above). Since we cannot pinpoint the exact location of T 2 , we instead take the union bound over all 
rrei < rre < m 2 , as whp T 2 is located in this interval. So the estimate is: 

m2 (N\ _ 

E E A-r* G o(i) (4) 

m=m\ i<dQn-\-n \mJ 


(we sum over all relevant values of m, adding o(l) in the end to account for the probability that T 2 
falls outside the interval [rrei,rre2]; then we sum over all possible values i of |ii'(F)|; then we bound 
from above by (^) the number of (re/4, 2)-expanders with i edges in the complete graph on re vertices, 
and finally we require the edges of F to be present in G(n, rre), but all at least re^/32 boosters relative 
to F to be omitted). The ratio of the binomial coefficients above can be estimated as follows: 




■i- — \ 

32 ) 
T—2 / 




(m —i) 


(iV) 

\mJ 


< 


—t) 


■N-i 
.m 


(iV) 

\m/ 


< 


assuming that (5o in the definition of do is small enough. We can thus estimate the r-th summand in 
Q as follows: 




__m 

• e 17 



_rn 

e 17 < 


erre 


don + re 


don-\-n 

_ 

■ e 17 


o{n 3), 


again for do small enough (it is even exponentially, and not just polynomially, small in re). Summing 
over all i < don + re and then over all mi < m < m 2 establishes the required claim. □ 
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The stage is now set to deliver the final punch of the proof of Theorem 14.21 Recall that our goal 
is to prove that for a random graph process G, whp the graph G = at the very moment T 2 when 
the minimum degree becomes 2 is already Hamiltonian. First, observe that by Lemma 14.61 who G 
contains an (n/4, 2)-expander Tq with at most don edges. We start with this sparse expander Tq 
and keep adding boosters to it until the current graph Tj becomes Hamiltonian; obviously at most 
n steps (edge additions) will be needed to reach Hamiltonicity. If we ever get stuck before reaching 
Hamiltonicity, say at step f > 0, then the current graph Tj is still an (n/4, 2)-expander, is connected 
and non-Hamiltonian, has at most don + n edges, but the graph G has no boosters with respect to 
Tj. This however does not happen typically due to Lemma 021 If so, the process of edge addition 
eventually completes with a subgraph Tj C G, which is Hamiltonian. The proof is complete! 
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